More on arbitrary boundary packed arithmetic
نویسندگان
چکیده
Recent microprocessors have been enhanced with media instruction sets for accelerating media algorithms. They exploit the fact that media algorithms have small data types, and widths much less than that of the processor. Current media instruction sets support only 8-, 16and 32-bit subdatatypes. This scheme is ineffecient in several applications where bit lengths of 9, 12 and so on are used. We need user programmable sub-datatype bit lengths. [1] discusses arbitrary boundary packed addition. Many media algorithms are based on multiplyaccumulate algorithms. For full acceleration we also need arbitrary boundary packed multiplication. We present such a scheme based on Wallace tree multiplication. We also expand on [1] and provide a detailed treatment of the intermediate carries of sub-datatypes which were lost in the previous work. These carries could be used for saturation arithmetic and flow control.
منابع مشابه
Arbitrary Precision Arithmetic - SIMD Style
Current day general purpose processors have been enhanced with what is called " media instruction set " t o achieve performance gains in applications that are media processing intensive. The instruction set that have been added exploit the fact that media applications have small native datatypes and have widths much less than that supported by commercial processors and the plethora of data-para...
متن کاملTransient thermoelastic analysis of FGM rotating thick cylindrical pressure vessels under arbitrary boundary and initial conditions
Assuming arbitrary boundary and initial conditions, a transient thermo-elastic analysis of a rotating thick cylindrical pressure vessel made of functionally graded material (FGM) subjected to axisymmetric mechanical and transient thermal loads is presented. Time-dependent thermal and mechanical boundary conditions are assumed to act on the boundaries of the vessel. Material properties of the ve...
متن کاملRobust Construction of 3-D Conforming Delaunay Meshes Using Arbitrary-Precision Arithmetic
An algorithm for the construction of 3-D conforming Delaunay tetrahedralizations is presented. The boundary of the meshed domain is contained within Voronoï cells of the boundary vertices of the resulting mesh. The algorithm is explained heuristically. It has been implemented. The problem of numerical precision is shown to be a major obstacle to robust implementation of the algorithm. The Autom...
متن کاملExact analytical approach for free longitudinal vibration of nanorods based on nonlocal elasticity theory from wave standpoint
In this paper, free longitudinal vibration of nanorods is investigated from the wave viewpoint. The Eringen’s nonlocal elasticity theory is used for nanorods modelling. Wave propagation in a medium has a similar formulation as vibrations and thus, it can be used to describe the vibration behavior. Boundaries reflect the propagating waves after incident. Firstly, the governing quation of nanoro...
متن کاملThermal Development for Ducts of Arbitrary Cross Sections by Boundary-Fitted Coordinate Transformation Method
The non-orthogonal boundary-fitted coordinate transformation method is applied to the solution of steady three-dimensional momentum and energy equations in laminar flow to obtain temperature field and Nusselt numbers in the thermal entry region of straight ducts of different cross sectional geometries. The conservation equations originally written in Cartesian coordinates are parabolized in the...
متن کامل